Less Is More: Pay Less Attention in Vision Transformers
نویسندگان
چکیده
Transformers have become one of the dominant architectures in deep learning, particularly as a powerful alternative to convolutional neural networks (CNNs) computer vision. However, Transformer training and inference previous works can be prohibitively expensive due quadratic complexity self-attention over long sequence representations, especially for high-resolution dense prediction tasks. To this end, we present novel Less attention vIsion (LIT), building upon fact that early layers still focus on local patterns bring minor benefits recent hierarchical vision Transformers. Specifically, propose where use pure multi-layer perceptrons (MLPs) encode rich stages while applying modules capture longer dependencies deeper layers. Moreover, further learned deformable token merging module adaptively fuse informative patches non-uniform manner. The proposed LIT achieves promising performance image recognition tasks, including classification, object detection instance segmentation, serving strong backbone many Code is available at https://github.com/zip-group/LIT.
منابع مشابه
Less is more: compact genomes pay dividends.
In 1993, Sydney Brenner, like many others, recognized that vertebrates are distinct in their morphology and development and that access to the complete sequence of a vertebrate genome would yield valuable insights into the biology of higher species not obtainable from genome studies of yeast, fly, or even the nematode. Moreover, at that time it was not possible, through sequencing technology, t...
متن کاملLess is more… (more or less…).
In April 1981 Xerox introduced the Star 8010 workstation, the first commercial system with a Graphical User Interface (GUI) and the first to use the “desktop” metaphor to organize a user’s interactions with the computer. Despite the perception of huge progress, from the perspective of design and usage models, there has been precious little progress in the intervening years. In the tradition of ...
متن کاملLess Is More, More the Merrier, or More From Less?
P hysicians deal with uncertainty all the time and chest pain in the emergency department (ED) is a typical example. Traditionally, coronary artery disease (CAD), pulmonary embolism (PE), and aortic dissection can present as chest pain, and the consequences of a missed diagnosis can be devastating, with the potential for rapid deterioration, and serious risk of morbidity and mortality. Moreover...
متن کاملLess Is More?
Judges in the United States, the United Kingdom, and Canada have ruled that witnesses may not wear the niqab—a type of face veil—when testifying, in part because they believed that it was necessary to see a person’s face to detect deception (Muhammad v. Enterprise Rent-A-Car, 2006; R. v. N. S., 2010; The Queen v. D(R), 2013). In two studies, we used conventional research methods and safeguards ...
متن کاملLess is more.
Copyright 2012 by the National Academy of Sciences. All rights reserved. The views expressed in this commentary are those of the author and not necessarily of the author’s organization or of the Institute of Medicine. The commentary is intended to help inform and stimulate discussion. It has not been subjected to the review procedures of the Institute of Medicine and is not a report of the Inst...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2022
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v36i2.20099